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QUILLEN PROPERTY 
OF REAL ALGEBRAIC VARIETIES 

MIHAI PUTINAR AND CLAUS SCHEIDERER 



Abstract. A conjugation-invariant ideal I C C[zj,Zj: j — 1, ...,n] 
has the Quillen property if every real valued, strictly positive polynomial 
on the real zero set Vk(7) C C n is a sum of hermitian squares modulo I. 
We first relate the Quillen property to the archimedean property from 

£f~) . real algebra. Using hereditary calculus, we then quantize and show that 

the Quillen property implies the subnormality of commuting tuples of 
Hilbert space operators satisfying the identities in /. In the finite rank 
case we give a complete geometric characterization of when the identities 

•^L m I imply normality for a commuting tuple of matrices. This geometric 

interpretation provides simple means to refute Quillen's property of an 
ideal. We also generalize these notions and results from real algebraic 



sets to semi-algebraic sets in < 



> . 1. Introduction 

l> 

t^J- ■ On any (affine) real algebraic variety V there exists a natural source for 

positivity certificates, namely squares (of regular functions): Any square, 
and hence any sum of squares, is nonnegative whereever it is defined on the 
IR-points of V. This observation lies at the very basis of real algebra, starting 
with Hilbert's 17th problem and its solution by Artin. Today the polarity 
between positivity and sums of squares is the focus of intense research, both 
from theoretical and applied points of view. See [TT] and [19] for recent 



surveys. 

rN ■ In the present article we consider real algebraic subvarieties V of complex 

affine space. The embedding in complex space provides V with additional 
structure and gives the notion of holomorphic (and antiholomorphic) ele- 
ments in the complexified structural rings of V. Accordingly we get a sec- 
ond, more restricted kind of positivity certificate, namely sums of hermitian 
squares on V, that is, of squared absolute values of holomorphic polynomials 
restricted to V. Our aim is to study this notion from the points of view of 
real algebra, geometry and operator theory. 

We work with several complex variables z = (z±, . . . , z n ) and their conju- 
gates z = (zi, . . . ,z n ). Let I C C[z,z] be a conjugation-invariant ideal, and 
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let Vr(7) C C n be its zero set, a real algebraic subset of C n . Let p G C[z,z] 
be a conjugation-invariant polynomial that is non-negative on Vr(I). We 
study the question whether p admits an identity 

p(z,z) = |/ii(z)| 2 + --- + |/ Jr (z)| 2 + 5 (z,z) (1) 

with g E I, in which hi, . . . , h r G C[z] are holomorphic polynomials. When 
such an identity exists we will say that p is a sum of hermitian squares 
modulo I. 

A classical instance where this property holds is the case of the unit circle 
T C C and its vanishing ideal I = (zz — 1). According to the Riesz-Fejer 
theorem, any p E C[z,z] non-negative on T is a single hermitian square 
p = \h(z)\ 2 modulo /. 

The first multivariate example with such a property was discovered almost 
half a century ago by Quillen [16]. He studied the unit sphere § C C n and 
its reduced ideal I, and showed that any p strictly positive on § is a sum of 
hermitian squares modulo /. 

Quillen's theorem amounts to a Positivstellensatz on the sphere vis-a- 
vis sums of hermitian squares, rather than ordinary squares. It is our aim 
to prove this result in greater generality, and to study the algebraic and 
geometric implications of such a result. Although our approach is basically 
algebraic, the interlacing with Hilbert space methods and operator theory 
is a recurrent theme of our study. 

Fixing a conjugation-invariant ideal / C C[z,z], we will say that / has 
the Quillen property if the Positivstellensatz holds for hermitian sums of 
squares modulo /. Assuming that Vr(7) is compact, an abstract charac- 
terization of this property comes from real algebra (Proposition I3.2J ). This 
characterization, however, is often not explicit enough. An improvement, on 
the constructive side, is offered by a known link to operator theory. Specif- 
ically, given p G C[z,z], and given a commuting tuple T = (T\, . . . ,T n ) of 
bounded linear operators on a Hilbert space, define the operator p(T, T*) us- 
ing hereditary calculus, thereby putting all adjoints to the left. We consider 
the following properties of the ideal I: 

(A) (Archimedean property) c — X^?=i \ z j\ 2 ^ s a sum of hermitian squares 
modulo I, for some real number c. 

(Q) (Quillen property) Every conjugation-invariant polynomial strictly 
positive on Vr(I) is a sum of hermitian squares modulo I. 

(S) (Subnormality) Every commuting tuple T of bounded operators on a 
Hilbert space and satisfying f(T,T*) = for all / G / is subnormal. 

(Sf) (Finite rank subnormality) Every commuting tuple T of operators 
acting on a finite- dimensional Hilbert space and satisfying f(T,T*) = for 
all / G I is subnormal (hence normal), 
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(G) (Geometric normality) The ideal I is not contained in the any of the 
"diamond" ideals 

I(a, b) = {/eC[z,z]: f( a ,a) = f(a,b) = f(b,a) = f(b,b) = 0} 

for a 7^ b in C n , and neither in any of their degenerations J(a, U), see 14.21 
We prove the implications 

(A) =► (Q) => (S) => (Sf) ^ (G), 

(A) <£= (Q) if Vr(I) is compact, 

see 13.21 13.91 and 14.41 We also analyze by means of examples why the missing 
implications do not hold. For instance, even real conies in C offer nontrivial 
features (|5.4| I5.8J) : A circle satisfies (A) , an eccentric ellipse has property 
(S) but not (Q), the non-reduced ideal of a circle with a double point satis- 
fies (Sf) but not (S), and a hyperbola whose asymptotes are perpendicular 
doesn't satisfy (Sf). 

We then extend the study of hermitian Positivstellensatze from real alge- 
braic sets to semi-algebraic sets in C n . To this end we replace the semiring 
of hermitian sums of squares mod I by a hermitian module M , and the real 
algebraic set Vr(I) by the semi-algebraic set Xm C C n associated with M. 
Defining properties (Q), (S) and (Sf) for M accordingly, the implications (Q) 
=>■ (S) =4> (Sf ) remain true. When M is archimedean and satisfies a polyno- 
mial convexity property, the reverse (S) =4> (Q) holds true as well (Theorem 
I6.16p . When M is finitely generated, we prove that the Quillen property is 
incompatible with Xm containing an analytic disc (Theorem 16. 20p . In this 
direction we mention article [6] , where a notion of hermitian complexity was 
introduced for conjugation-invariant ideals with the precise aim of bridging 
the gap between Quillen's property at one end and the existence of analytic 
discs in the support at the other. 

At the end of the paper we make a few historical comments putting this 
work into perspective, mentioning some of the analytic roots and applica- 
tions of hermitian sums of squares. 

2. Preliminaries and notation 

2.1. Let C[z,z] be the polynomial ring in 2re independent variables z = 
(z\, . . . ,z n ) and z = (~z~i, . . . ,~z n ). On C[z, z] we consider the C/M-involution 
z* = Zj (j = 1, . . . , n). Thus 

(J> Qi/3 z^)* = ^o^z^z" 

for a a fl S C, with the usual multi-index notation z a z^ = YYj=i z ? z / ■ The 
fixed ring of * is the polynomial ring M[x,y] generated by the 2n variables 
x = (xi,...,x n ) and y = (j/i, ... ,y n ), where Xj = \(zj +Zj) and yj = 
Yi( z i ~ Zj) (and i = \J — 1). Thus C[z,z] is identified with R[x,y] <g) C, and 
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under this identification, the involution * becomes complex conjugation in 
the second tensor component. 

Given / G C[z,z] and a,b G C n , we write f(a,b) G C for the result of 
substituting a for z and b for z. We often abbreviate f(a) := f(a,a). 

2.2. There is a one-to-one correspondence between *-invariant ideals J of 
C[z,z] and arbitrary ideals / of R[x, y], given by J t— > I := JnM[x,y]. Given 
an ideal / of R[x, y], we denote the zero set of I in C n by 

Vk(I) := {aGC":V/G//(a) = 0}. 

This is a real algebraic subset of C n . 

2.3. For every p G C[z,z], the hermitian norm \p\ 2 := pp* is a sum of two 
(usual) squares in R[x, y]. The convex cone in R[x, y] generated by {\p\ 2 : p G 
C[z,z]} will be denoted by E; it is the cone of all (usual) sums of squares 
in R[x,y]. The smaller convex cone in R[x,y] generated by {|p| 2 : p G C[z]} 
is denoted by E^. Its elements are called the hermitian sums of squares. 

2.4. We recall a few notions from real algebra. Given an R-algebra A (i.e., 
a commutative ring containing R), a subset S C A will be called a semiring 
in A if 5 contains the nonnegative real numbers and is closed in A under 
taking sums and products. Given a semiring S, an S-module is a subset M 
of A with M + M C M, SM C M and 1 G M. A particularly important 
semiring is EA 2 , the set of all (finite) sums of squares in A. The modules 
over this semiring are usually referred to as the quadratic modules in A. 

The 5-module M is said to be archimedean if A = R + M, that is, if for 
every f £ A there exists c G R with c ± / G M. 

In this paper we will mostly be concerned with the R-algebra A = R[x, y] 
and with the two semirings E^ C E in R[x, y]. 

2.5. Given a module M over some semiring S in R[x, y], we write 

X M := {a G C n : V 5 G M g{a) > 0}, 
which is a closed subset of C n . 

The celebrated archimedean Positivstellensatz from real algebra (see |12| 
or |19| ) implies: 



Theorem 2.6. If M is a module over an archimedean semiring S in R[x, y] ; 
then M contains any f G R[x, y] that is strictly positive on the set Xm- 

3. Hermitian sums of squares and subnormal tuples of 

operators 

3.1. Let R[x,y] C C[z,z] be the fixed ring of * fsee !2.ip . Given any ideal 
/ C R[x, y], we will consider the semiring S = E^ + 1 in R[x, y]. Note that 
Xs = Vr(I)- Consider the following two properties of the ideal /: 

(A) (Archimedean Property) The semiring E^ + I in R[x,y] is archi- 
medean (see !2.4j) : 
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(Q) (Quillen Property) E^ + 1 contains every / G R[x, y] that is strictly 
positive on Mr (I). 

We'll also refer to (A) by saying that T,^ is archimedean modulo /. 

Given any *-invariant ideal J C C[z,z], we will say that J has property 
(A) resp. (Q) if the ideal J n R[x, y] of R[x,y] has the respective property 
(see[22D. 

The following result was proved in [J3] (Theorem 2.1 and Proposition 
2.2). It is essentially an application of the archimedean Positivstellensatz 



Proposition 3.2. For any ideal I C R[x,y], the following are equivalent: 

(i) / has the Archimedean property (A); 
(ii) / has the Quillen property (Q), and Vr(I) is compact; 
(iii) / contains a polynomial of the form \\z\\ 2 +p + a, where peE;, and 
a£K. 

(We are using the shorthand ||z|| 2 := \z\\ 2 + • • • + \z n \ 2 .) 

Remarks 3.3. 

1. Quillen's theorem [16], reproved later by Catlin-D'Angelo [3], was men- 
tioned in the introduction. The statement is recovered here in a purely 
algebraic way, as a very particular instance of Proposition 13.21 

As observed in [3], Quillen's theorem implies the following classical the- 
orem due to Polya: Given a homogeneous polynomial / € R[xi, . . . ,x n ] 
strictly positive on {a G M n : a>\ > 0, . . . , a n > 0} n {(0, . . . , 0)}, the form 
(x\ + • • • + x n ) N f has positive coefficients for large enough N > 0. 

2. Condition (iii) of 13.21 gives an abstract algebraic characterization of 
the ideals I with Vr(I) compact and with property (Q). Note that the Posi- 
tivstellensatz for usual sums of squares holds whenever Vk(J) is compact, by 
Schmiidgen's theorem [20]. In contrast, "most" ideals with Vk(I) compact 
do not satisfy property (Q) (see, e.g.. 15.41 below). 

The applicability of I3.2f iii) as an algebraic criterion for property (Q) is 
somewhat limited, since this condition is not sufficiently explicit. In par- 
ticular, it is usually cumbersome to prove that an ideal I does not contain 
any polynomial of the form given in (iii). Therefore it is desirable to know 
other conditions on / that are necessary for (A) resp. (Q), and that are more 
easily checked. In this section and the next we will offer two conditions of 
very different nature that are both necessary for the Quillen property, one 
operator-theoretic and one ideal-theoretic. 

3. Part of the original motivation for this work came from a question of 
D'Angelo. Given a compact real algebraic set X C C n which is the boundary 
of a strictly pseudo-convex region in C n , D'Angelo had asked whether every 
strictly positive polynomial on X is a sum of hermitian squares on X. This 
question was answered in the negative, see ^\A 
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Examples 3.4. 

1. The Quillen property (Q) alone does not imply the archimedean prop- 
erty (A), since Vk(7) need not be compact. This is seen by considering a line 
in C, given (say) by the ideal / = (y) C M.[x, y]. Condition (Q) is satisfied 
since, in fact, S^ + / contains every / G R[x,y] nonnegative on the line 
y = 0. Indeed, such / is a sum of two usual squares modulo /, from which 
one sees easily that / is congruent modulo I to a single hermitian square, 
i.e., / = \p\ 2 (mod I) with p E C[z\. 

2. If n = 1 and / £ R[x, y] has degree 2, the principal ideal / = (/) 
satisfies the Archimedean property (A) if and only if there exist qgC and 
a,c£l with / = a\z — a\ 2 + c. This will be proved in Theorem 15.41 below. 

3.5. Let E be a (separable complex) Hilbert space, and let B(E) denote 
the algebra of bounded linear operators on E. Fix a tuple T = (T\, . . . ,T n ) 
of operators Tj G B(E) that commute pairwise. We use hereditary calculus 
(see [1] Section 14.2 for more details). Given a monomial / = z Q z^ (with 
a, j3 G Z") we write 

We extend this definition C-linearly, thereby putting all adjoints to the left. 
This defines the C-linear map 

i> T : C[z,z] -)> B(E), /(z,z) h- fa(f) = f(T,T*). 

The map tJjt commutes with the involution, i.e. ipr{f*) = Vt(/)*- in P ar " 
ticular, tprif) is self-adjoint for / = /*. Note that 



tpT[q(z) -/(z,z) -p(z)J = ^T(q)*^T(f)4>T{p) 
for p, q £ C[z] and / 6 C[z, z]. The set 

M T := {/Gl[x,y]:f(/)>0} 

(of real polynomials / for which the self-adjoint operator Vt(/) is nonneg- 
ative) is a I^-module in M[x,y], since ipT(\p\ 2 f) = V ; r(p(z)/(z, z)p(z)) = 
p(T)*7p T (f)p(T) holds for / € C[z,z] und p G C[z]. The support M T D 
(_M T ) = ker(V'T) of M T is an ideal in M[x,y]. Note that the subset M T of 
M[x, y] is closed with respect to the finest locally convex topology on R[x, y]. 

3.6. Recall that the tuple T is said to be (jointly) subnormal if T can be 
extended to a commuting tuple of normal operators on a larger Hilbert space, 
i.e., if there is a tuple T" = (T[, . . . , T^) of commuting normal operators on 
a Hilbert space £7' such that E' contains E and the T[ leave 1£ invariant 
and satisfy T[\e = T{ for i = 1, . . . , n. Note that subnormal is equivalent to 
normal when dim(E) < oo. For details see [1]. 

According to the Halmos-Bram-Ito criterion (see [9]), the commuting tu- 
ple T = (T\, . . . ,T n ) is subnormal if and only if 

Y,(T a ^,T^ a ) > 
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for all finitely supported families {CaJaeZ" in E. Using this criterion we 
show: 

Proposition 3.7. Let T = (Ti,...,T n ) be a commuting tuple in B{E). 
Then T is subnormal if and only if E C Mt- 

In other words, the tuple T is subnormal if and only if VtGpI 2 ) > holds 
for every p £ C[z,z]. 

Proof. Assume iPt(\p\ 2 ) > for every p £ C[z,z]. To prove that T is 
subnormal we can, using a result of Stochel ([213, Cor. 3.2), assume that 
there exists a cyclic vector £ for T, i.e. the linear span of {T a £: a £ Z™} is 
dense in E. It suffices to verify the Halmos-Bram-Ito condition for all finite 
families {£ a } lying in the linear span of {T Q £: a £ Z™}. So let £ a = p a (T)£ 
where p a £ C[z] for a G Z™ (and p a = for almost all a), and consider 
P : =E Q Pa( z )z Q eC[z,z]. Since 

\ P \ 2 = Y.p^)pW)*^, 

Q,/3 

the assumption E C My gives 

0< (M\P\ 2 %0 = ^<T^ Q (T)e, T a p p {T)C) = "£( T ^, T a ^>, 

which shows that T is subnormal. Conversely, the same argument shows 
that T subnormal implies E C Mt- D 

3.8. We shall consider the following properties of an ideal I C R[x,y]: 

(S) (Subnormality) Every commuting tuple T = (T 1; . . . , T n ) of bounded 

linear operators in a Hilbert space satisfying p(T,T*) = for every 

p £ I is subnormal. 

(Sf) (Finite rank subnormality) Every commuting tuple T = (T\, . . . ,T n ) 

of complex matrices satisfying p(T, T*) = for every p £ I is normal. 

Trivially (S) implies (Sf). Condition (Sf) will be considered in the next 
section. Here we first show that condition (S) is necessary for the Quillen 
property (Q). (This fact was announced without proof in [T5] Corollary 2.2). 

Proposition 3.9. For any ideal I C M[x, y], Quillen property (Q) implies 
the subnormality condition (S). 

Proof. Assume (Q) holds for /. Given a commuting tuple T of bounded 
operators with / C ker(^r), we have E^ + 1 C Mt- Since My is closed with 
respect to the finest locally convex topology of M[x, y] (|3.5p . it follows from 
(Q) that Mt contains every polynomial that is nonnegative on Vk(I). In 
particular we have E C Mt, which implies that T is subnormal (Proposition 
I37FD. D 



Remark 3.10. In the case when Vr(I) is compact, we can give a very short 
proof of Proposition 13.91 using Athavale's theorem [2]. Indeed, assume that 
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is compact and (Q) holds for /. After suitably scaling the variables 
we can assume |£j| < 1 for every £ = (£i,...,£n) £ Vr(J). Let T be a 
commuting tuple of bounded operators satisfying I C ker(V"r)- In order to 
show that T is subnormal it suffices, by [2 J Theorem 4.1, to show for any 
tuple a = (a±, . . . , a n ) of nonnegative integers that 



/ := na-is/r G M ^' 



Now / > on Vr(-T), so the assumption on I implies /sS/,+1, from which 
■0t(/) > is obvious. 

Remark 3.11. The subnormality property (S) on an ideal I C R[x, y] is 
strictly weaker than the Quillen property (Q). An immediate example to 
show this is given by the ideal I = (x\, . . . , x n ) = (zj + Zj : j = 1, . . . , n) in 
R[x, y]: Every commuting tuple T of operators with I C ker(^T) consists 
clearly of normal operators. On the other hand, for any n > 2 there exist 
strictly positive polynomials on Vjr(I) = MJ 1 that are not even sums of usual 
squares, and a fortiori not of hermitian squares. For instance, adding a 
positive constant to the well-known Motzkin polynomial 2/f J/1 +J/i J/2 — 3y ^yf + 
1 gives such an example. 

It is less straightforward to find an ideal / satisfying (S) but not (Q), for 
which Vr(I) is compact. Let f(z,z) = be the equation of an ellipse that 
is not a circle. Then every bounded operator T satisfying f(T,T*) = is 
subnormal, that is, the principal ideal I = (/) satisfies (S). But / does not 
have the Quillen property, see Theorem 15.41 below, and also |15j . 

4. Normal tuples of matrices 

In this section we will provide a complete geometric characterization of 
the ideals satisfying finite rank subnormality (Sf). More specifically, we 
will explicitly list those ideals that are maximal with respect to not satisfy- 
ing(Sf). 

First we need some preparation. It seems more natural here to work with 
♦-invariant ideals of C[z,z], rather than with ideals of R[x, y]. 

4.1. Given o / 6 in C n , let I(a,b) C C[z,z] be the ideal consisting of all 
polynomials /(z,z) with 

f(a,a) = f(b,b) = f(a,b) = f(b,a) = 0. 

Clearly, I (a, b) is *-invariant. As an ideal in C[z, z], note that I(a, b) is gener- 
ated by the polynomials p(z) and p(z)*, where p(z) G C[z] is a holomorphic 
polynomial satisfying p(a) = p(b) = 0. 

These ideals were introduced in [T3], where I(a,b) was denoted by J a ^. 

4.2. The usual inner product on the space of hermitian n x n matrices will 
be denoted by {S,T) := tv(ST). Given a G C n and a complex hermitian 
n x n matrix U 7^ 0, let J(a, U) be the set of all / G C[z, z] such that 
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(l)/(o,5) = 0, 

(2) U-V z f(a,a) = U-V^f(a,a) = 0, 

(3) (U, V^/(a,a)> = 0. 

Here we denote the holomorphic resp. antiholomorphic gradient by 

\OZjJ j=l, ...,n \OZjJ j=l,...,n 

(regarded as column vectors), and the mixed Hessian (Levi form) by 



ZZJ v dzjdzkJj,k=i, 

It is easy to see that J (a, U) is a *-invariant ideal in C[z, z]. 

Example 4.3. With a view toward the proof of Theorem 14.41 below, let us 
consider the following example. Fix an integer r > 1 and column vectors 
w\, . . . ,w n £ C' r , not all of them zero. Moreover, let a = (oi, . . . , a n ) £ C™, 
and let 

Tj = ( aj ° T ) G M r+1 (C) 

(we are using a (l,r) block matrix notation). Clearly, T = (T\, . . . ,T n ) is 
a commuting tuple of matrices, and is not normal since Wj ^ for at least 
one j. A straightforward calculation shows ker('0y) = J(a,U), where U is 
the nonnegative hermitian n x n-matrix 

Note that the rank of U is the dimension of the linear span of w\, . . . , w n in 

C. 

Next comes the main result of this section. It gives a complete ideal- 
theoretic characterization of condition (Sf): 

Theorem 4.4. Let I C C[z,z] be an ideal. The following are equivalent: 

(Sf) Every commuting tuple T = (T\, . . . ,T n ) of complex matrices satis- 
fying I C ker(^r) is normal; 

(G) I is not contained in I [a, b) for any pair a ^ b in C n , and neither in 
J(a, U) for any a G C n and any nonnegative hermitian nxn matrix 

4.5. We prove the implication (Sf) =>■ (G) by contraposition. More precisely, 
we will show: 

(a) For any a ^ b in C", there exists a commuting non-normal n-tuple 
T of 2 x 2 matrices with ker(^y) = I (a, b). 

(b) For any a £ C n and any nonnegative hermitian nxn matrix U ^ 0, 
there exists a commuting non-normal n-tuple Tofmxm matrices 
with ker(f/>r) = J(a, U). (We can take m = rk(L'") + 1 here.) 



10 MIHAI PUTINAR AND CLAUS SCHEIDERER 

In fact, (b) has already been proved by Example 14.31 (The last assertion 
comes from the fact that a nonnegative hermitian matrix U of rank r > 1 
can be written U = W*W with W £ M rxn (C).) Assertion (a) will be proved 
in 14.61 and 14.71 The reverse implication (G) =4> (Sf) will be proved in 14.81 

4.6. Let a ^ b in C n . Fix two linearly independent vectors u, v in C 2 that 
are not perpendicular. Let Tj £ M2(C) be the matrix satisfying TjU = ajU 
and TjV = bjV (j = l,...,n). Then T = (Ti,...,T n ) is a commuting 
tuple of matrices. Clearly, the matrix Tj fails to be normal for any index 
j with dj 7^ bj, and in particular, the tuple T is not normal. We claim 
ker(f/;j i ) = I(a,b). The inclusion I(a,b) C ker(^) is obvious. Since the 
vector space C[z, z]/I(a, b) has dimension 4, we have to show that the linear 
map ipT'- C[z,z] —7- M2(C) is surjective. This in turn follows immediately 
from the following lemma. 

Lemma 4.7. Let S £ M2(C). The matrices I, S, S* , S*S are linearly 
dependent if and only if S is normal. 

Proof. Let W s be the linear span of /, S, S* and S*S in M 2 (C). We have 
Ws = Ws-xi for every A £ C Since S — XI is normal iff S is normal, we 
can replace S by S — XI for any A £ C. In particular, we may do this for A 
an eigenvalue of S. After changing to a suitable orthonormal basis we can 
therefore assume S = { f) where a, b £ C. For this matrix it is immediate 
that Ws 7^ M2(C) if and only if a = 0, if and only if S is normal. □ 

4.8. We now show that (G) implies (Sf) in Theorem 14.41 again by contra- 
position. To this end let E be a finite-dimensional Hilbert space, and let 
T = (Ti, . . . ,T n ) be a commuting tuple of endomorphisms of E such that 
at least one Tj is not normal. We'll show that the ideal ker(^r) of C[z,z] is 
contained in one of the ideals I (a, b) or J (a, U), as in (G). 

Let F be any T-invariant subspace of E (that is, TjF C F holds for 
each j), and let T\F denote the restriction of T to F. So T\F is a commuting 
tuple of endomorphisms of F. Let i: F — > E be the inclusion map and 
it: E — > F the orthogonal projection onto F, and let 

p: End(E) ->• End(F), p(S) = iroSoi. 

For S £ End(-E) we have (S'jF)* = p(S*). Moreover, if S leaves F invariant, 
then p(S'S) = p(S')p(S), p(S*S') = p(S)*p(S') hold for any S' £ End(£). 
As maps C[z, z] — > End(-F), we therefore have ipT\F = P° ipT- In particular, 
ker(tpT) <= ker(^|^). In order to prove what we want, we can therefore 
replace E and T by F and T\F whenever F is a T-invariant subspace of E 
for which T\F is not normal. 

For any tuple a = (oi, . . . , a n ) £ C ra , denote by 

E(T,a) = {£ £ E: {Tj - aj )£ = for j = 1, . . . ,n} 

resp. by 

EooCT, a) = U G #: (2} - a,)^^ = for j = 1, . . . , n} 
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the a-eigenspace resp. the generalized a-eigenspace of T. These are T- 
invariant subspaces of E, and E = © aeC n Eoq(T, a). Since T is not normal, 
one of the following two situations occurs: 

(1) One of the Tj is not diagonalizable; 

(2) each Tj is diagonalizable, but for at least one index j there are two 
eigenspaces of Tj that are not perpendicular. 

Let us first discuss case (2). By assumption we have E = © aeC n E(T,a), 
and there exist a ^ b in C n such that E(T, a) and E(T, b) are not perpendic- 
ular. Pick vectors x £ E(T, a) and y £ E(T, b) that are not perpendicular. 
The two-dimensional subspace F spanned by x and y is T-invariant, and T\F 
is not normal. By the argument used in 14.61 we see that ker(^ r i^) = I (a, b). 
So we are finished with case (2). 

Now we discuss case (1) and assume that one of the Tj cannot be diago- 
nalized. Then there exists a £ C n with E(T, a) ^ E^T, a). Replacing E by 
-Eoo(T, a) and Tj by Tj — a,j for each j (the latter corresponding to a change 
of variables Zj —> Zj — dj in the polynomial ring) , we can assume that each 
Tj is nilpotent and Tj ^ for at least one j. Let c > 2 be the highest order 
of nilpotency among the Tj, that is, assume T? = for all j and T c_ ^ 
for one index jo. Replacing E by ker(T c_ ) we can assume Tj = for all j. 

Let Vj = ker(Tj) for j = 1, . . . ,n. Whenever there are two indices j, k 
with Vj ^ Vk, we can replace E by Vj. Iterating this step we arrive at 
the case where all nonzero operators among T\, . . . , T n have the same kernel 
V 7^ E. Thus, for each j, we have either Tj = or im(Tj) C ker(Tj) = V, 
and the latter occurs for at least one index j. 

Choose a nonzero vector x £ V^. The subspace F := Cx © V of E is 
T-invariant, and we can replace E with F. Put yj = TjX (j = 1, . . . , n), and 
let W C V be the linear span of yi, . . . , y n . We can replace E by Cx © W, 
and have now arrived at a minimal non-normal tuple of operators. 

Let r = dim(W), so 1 < r < n. Fixing an orthonormal linear basis of W, 
we represent the operators Tj by (r + 1) x (r + 1) matrices as 

/ • • • 0\ 
yy o ••• 

{yn o ... oy 

Let Wj = (yij, . . . ,y r j), regarded as a column vector (j = 1, . . . , n), and let 

U = (^>fe)i< ilfc <„ 

a psd hermitian matrix of rank r. From Example 14.31 we see ker(^j') = 
J(a,U). 

This completes the proof of Theorem I4.4L □ 

Remark 4.9. Ideals of the form I(a,b) or J(a,U), as in 14.41 are pairwise 
incomparable with respect to inclusion, except that J(a, U) = J(a, cU) for 
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every real number c > 0. To see that J(a, U) C J(a, U') implies U' = 
cU with c > 0, observe that the mixed Hessians of elements of J(a, U) 
are precisely the matrices that are orthogonal to U (condition (3) of I4.2H . 
Therefore U is determined by J {a, U) up to (positive) scaling. 

So we see that the ideals I(a, b) and J(a, U) are precisely the maximal 
ones among the ideals of relations between non-normal commuting tuples of 
matrices (in the sense of hereditary calculus) . 

Remark 4.10. A complex square matrix may be non-normal for two rea- 
sons: It may fail to be diagonalizable, or it may have two non-perpendicular 
eigenvectors for different eigenvalues. The ideals J(a, U) and I(a, b) in 
Theorem 14.41 correspond to these two possibilities. More precisely, if T = 
(Ti, ... ,T n ) is a commuting tuple of matrices, and if one of the Tj is not 
diagonalizable, then ker(^y) C J(a,U) for some pair (a,U). On the other 
hand, if the Tj are diagonalizable but one of them has two non-perpendicular 
eigenspaces, then ker(^y) C I(a,b) for some pair (a, 6). Both assertions are 
clear from the proof in 14.81 



Remark 4.11. Consider commuting tuples T = (Ti, . . . , T n ) in B(E) where 
E is a complex Hilbert space, together with the associated maps ipT ■ C[z, z] — ) 
B(E) given by hereditary calculus (|3.5p . It is a consequence of Fuglede's 
theorem that the tuple T is normal if and only if ifo is a ring homomorphism. 
As a consequence of Theorem 14.41 we can add another characterization, as 
long as E has finite dimension. It shows that the normality of a commuting 
tuple T of matrices can be decided from its ideal ker(V"r) of relations: 

Corollary 4.12. A commuting tuple T of matrices is normal if and only 
if the ideal ker(^r) is not contained in I{a, b) for any a ^ b in C n , and 
neither in J(a, U) for any a G C n and any nonnegative hermitian n x n- 
matrix U ^ 0. 

Proof. Indeed, if T is normal, there is an orthogonal basis of simultaneous 
eigenvectors. This implies that ker(?/V) is an intersection of finitely many 
ideals m a = {/ G C[z, z] : f(a,a) = 0}, a G C n . Such an intersection is never 
contained in any of the ideals I (a, b) or J (a, U). □ 

Remarks 4.13. 

1. Up to holomorphic linear coordinate changes there exist precisely n 
essentially different ideals J(a,U) in C[z,z]. Indeed, we can assume that 
o = and that 

U = U r := diag(l,...,l,0,...,0) 

is the diagonal matrix of rank r, where 1 < r < n can be arbitrary. In this 
case, J(0, U r ) consists of all / G C[z, z] which are modulo (zi, . . . , z n ) 2 + 
(zi, . . . , ~z n ) 2 congruent to 

n n 

Yl ( h i z i + b 'j^ + Yl c i k z ^ k 

j=r+l j,k=l 
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with bj, b'p Cjk S C and 

en H + c rr = 0. 

A system of generators for the ideal J(0, U r ) is therefore given by the fol- 
lowing list of polynomials: 

ZjZ k , ZjZ k 1 < j < k < r, 

ZjZ k , z k Zj 1 < j < k <r, 

\zj\ 2 - \z j+1 \ 2 l<j<r, 

Zj, ~Zj r + 1 < j < n. 

2. In [14], the ideals 

J a ,a = (z 1 -ai,...,z n -a n ) 2 + (zi-a 1 ,...,^ n -a n ) 2 (o E C") 

of C[z,z] were used. They ideals relate to the ideals J(a,U) studied here 
via 

Ja,a = f]j(a,U), 

U 

intersection over all nonnegative hermitian matrices U ^ 0. In particular, 
in the one variable case (n = 1) we have J a<a = J (a, 1). 

As a consequence of Theorem 14.41 and Proposition 13.91 we obtain: 

Corollary 4.14. Let I C R[x,y] be an ideal, and assume that I C I(a,b) 
for some a ^ b in C n , or that I C J(a, U) for some a G C™ and some 
nonnegative hermitian nxn matrix U ^ 0. Then there exists f £ R[x, y] such 
that f > on V^(I), but f is not a hermitian sum of squares modulo I. □ 

In the first case of Corollary 14. 141 the assertion was already proved in [14] 
Proposition 3.1, by a different argument. 

5. Examples 

We start by identifying some classes of (principal) ideals that satisfy the 
subnormality condition (S). 

Proposition 5.1. Let f = f* £ C[z,z] be of the form 

r 

f = Re 5 (z)-]T| % (z)| 2 
fe=i 

where g, q\, . . . , q r E C[z]. Assume for every j = 1, . . . ,n that Zj is a poly- 
nomial in g, qi, . . . , q r , that is, 

C[g, qi,...,q r ] = C[zi, . . . , z n ]. 

Then every commuting tuple T satisfying f(T,T*) = is subnormal. In 
other words, the principal ideal I = (f) has property (S). 
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Proof. Choose a real number c > so large that the operator A := g(T)+cid 
is invertible. From 2c Re(g) = \g + c\ 2 — c 2 — \g\ 2 we get 

2c/ = \ g + c \ 2 -c 2 -\g\ 2 -2c^2\q k \ 2 . 

k 
This implies 

A* A = c 2 id + g(Ty g (T)+2cY J <lk{TTq k (T), 

k 
hence suitable scalings of the commuting operators 

A-\ g(T)A-\ qi(T)A-\ ..., q^A' 1 

satisfy the identity of the sphere. Therefore the tuple consisting of these 
operators is subnormal, by Athavale's theorem [2]. Using rational functional 
calculus in conjunction with the spectral inclusion theorem [13], we conclude 
that the tuple (g{T), qi(T), . . . , ^(T)) commutes and is subnormal. Now 
the hypothesis implies that the tuple T = (Ti, . . . ,T n ) is subnormal. □ 

5.2. We discuss yet another class of identities that entail the subnormality 
condition (S), this time in one variable (n = 1). Let / G M[x,y] have the 
form 

/ = \g( z )\ 2 -a-\l(z)\ 2 -J2Mz)\ 2 

fe=i 
where a > is a real number, g, q\, . . . , q r £ C[z] are arbitrary polynomials 
and I € C[z] has degree one. The identity f(T,T*) = implies g(T)*g(T) > 
al, and we conclude that g(T) is invertible. Inverting g(T) we again arrive 
at a sphere identity, and arguing as in 15.11 we conclude in particular that 
l(T) is subnormal, whence T is subnormal. 

This construction can also be performed in any number of variables. 

5.3. In certain cases we can prove that 14«(/) is compact and S^ is not 
archimedean modulo /, for / as in 15.21 Indeed, let 

m— 1 



/ = M 2m -E 



Uj\Z 



2.7 



with m > 2 and real coefficients ao, ■ • • , a m -i > 0. Then S^ + (/) is not 
archimedean. Indeed, assume c — \z\ 2 + fg G X/j, with c£R and g = g* G 
C[z,z]. Let bj be the coefficient of \z\ 2 i in g. For any j > 0, the coefficient 
of l^l 2 - 5 in c — \z\ 2 + fg is > 0. For j = 1 this gives 

- 1 - ai&o - ao^i > 0, (2) 

while for j = m + k with k > it gives 

6fc-a m _i6 fc+ i a 6fc+m > (fc > 0). (3) 

Let I > be the largest index for which b\ ^ (by (|2|), there has to be 
such I). From (J3J) for A; = Z we get b\ > 0. By a downward induction, 
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repeatedly using (|3|), we conclude that bk > holds for all k > 0. But this 
contradicts ([2]). On the other hand, property (2) holds as soon as ao > 
and a\ > 0, see 15.21 The zero set Mr(/) is the union of (at least one, at most 
m — 1) concentric circles around 0. 

Next, we look at the simplest case, which is plane conies. The following 
theorem shows that we can completely decide in which cases the various 
properties discussed so far are satisfied. In particular, it turns out that 
properties (S) and (Sf) are equivalent for plane conies: 

Theorem 5.4. Consider a nonconstant polynomial 

f = azz + az + az + j3z + (3z + c 

with a, c £ K and a, (3 E C, and let (/) be the principal ideal generated by 
f in M.[x,y]. 

(a) (/) has the Archimedean property (A) if, and only if, a = and 
a/0. 

(b) (/) has the Quillen property (Q) if, and only if a = 0. 

(c) Properties (S), (Sf) and (G) for the ideal (/) are equivalent among 
each other, and are also equivalent to fa / V a = a = 0). 

Note that the result [15J Theorem 3.3 on the ellipse is contained in (c) as 
a particular case. 

For the proof of the theorem we need the following simple observation. 
(A similar argument was used in [15], proof of Proposition 3.1.) The leading 
form lf(/) of ^ / G C[z,z] is the nonvanishing homogeneous part of / of 
highest degree. Clearly lf(fg) = lf(/)lf(s) and lf(/*) = lf(/)*. 

Lemma 5.5. (n arbitrary) For ^ f = f* £ Y>h we have lf(/) 6S(,. In 
particular, when n = 1, this implies If (/) = a(z~z) m where deg(/) = 2m and 

0<oeR. 

The lemma is obvious since in a sum £\. |c/j(z)| 2 , no cancellation of leading 
forms can occur. 

Proof of Theorem \5.4\ Assume a = and a/0. Then the identity 

af = \az + ]3\ 2 + (ac-\[3\ 2 ), 

combined with Proposition 13. 2\ shows that S/j + (/) is archimedean and 
(hence) contains every polynomial g with g > on Vk(/). If a = a = 
then / is linear, and after a holomorphic change of variables we may assume 
/ = ^(z — z) = y. By Remark 13.41 1 we see that S^ + (/) contains every 
g£R[x,y] with<?>0on V M (/). 

Conversely, we show that S^ + (/) cannot contain all polynomials strictly 
positive on Vr(/) when a/0. Indeed, assume a/0 and choose 7 G C with 
7 ^ Mr(/ )• For sufficiently small real r > 0, the polynomial g = \z—j\ 2 —r 2 is 
strictly positive on Vr(/). Assuming g G S^ + (/) would mean g + fh G £/i 
for some /i G C[z,z], and necessarily ft ^ 0. When /i is constant then 
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l(g + fh) contains Xz 2 for some A 7^ 0, contradicting Lemma 15.51 Otherwise 
deg(h) > 0, and then lf(/) divides lf(//i) = If (<? + fh), again contradicting 



We have thus proved (a) and (b). For the proof of (c) we easily dispense 
with the linear case a = a = 0, and can assume deg(/) = 2. If a = 
then / = g + g* with a quadratic holomorphic polynomial g G C[z\. For 
generic choice of t € R there are two different numbers a 7^ /3 in C with 
g(a) = g((3) = it. For any such pair we have f(a) = f((3) = /(a, /3) = 0, 
and hence / S I(a,j3). For a = 0, therefore, the ideal (/) does not satisfy 
condition (G), and a fortiori does not satisfy conditions (S) and (Sf), by 



On the other hand, assume a/0. Then / has the form 

/ = Keg(z) + a\z\ 

with g 6 C[z]. According to Proposition 15. 1\ every bounded operator T 
satisfying f(T,T*) = is subnormal. So (/) satisfies conditions (S), (Sf) 
and (G) in this case. □ 

Remarks 5.6. 

1. We rephrase part of Theorem 15.41 in geometric terms, and assume 
deg(/) = 2 and Vr(/) / for simplicity. Then S^ + (/) contains all 
polynomials positive on Vr(/) if and only if Vr(/) is a circle. On the other 
hand, the identity f(T,T*) = implies subnormality for a bounded opera- 
tor T if and only if Vr(/) is not a hyperbola with perpendicular asymptotes 
(and neither a union of two perpendicular lines). 

2. From Theorem l5.4l we see in particular that there exist ideals I C M[x, y] 
with Vr(/) = for which —1 ^ S^ + 1. This is in striking contrast to the 
case of usual sums of squares, where it is well known that Vr(/) = implies 
— 1 E E + J. Such ideals may well have the subnormality property (S). For 
example, this is so for / = (ax 2 + by 2 + c) with a, 6, c > and a ^ b. 

5.7. For (reduced) plane conies, the normality condition (Sf) for finite- 
dimensional Hilbert spaces already implies the subnormality condition (S) 
for arbitrary Hilbert spaces, as shown in !5.4i We now show that this ceases 
to hold when we take a suitable nonreduced version of a conic. 
To this end consider the *-invariant ideal 

J = ({z-l){zz-l), (z-l)(zz-lj) 

in C[z,z], respectively its real version 

I = JnR[x,y] = (x 2 + y 2 -l) -(x-l,y). 

The ideal corresponds to the unit circle with nilpotents added at one point. 
We will see that hermitian sums of squares modulo I behave quite different 
than modulo \fl. 

Let T G B(E) satisfy I C ker(V"r), that is, 

(T*-id)(T*T-id) = 0. (4) 
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We decompose the Hilbert space as E = ker(T — id) ® ker(T — id) -1 . With 
respect to this decomposition, T has a block matrix representation 

id A 
B 

with ker(^) n ker(5 - id) = {0}. From Q we deduce A(B - id) = and 
A* A + (B*B - id) (5 - id) = 0. The second identity implies that B - id 
is actually injective. If dim(E) < oo, then B — id is invertible, and we get 
A = and B*B — id = 0. In short, T is unitary. Every (finite-dimensional) 
matrix annihilated by the ideal J is therefore unitary, and hence normal. 

On the other hand, we will produce an operator T acting on E = £ 2 (N) 
such that T is annihilated by the ideal J and T is not subnormal. Let E 
have Hilbert basis e& (k > 0), and let S: e^ i— > e^+i (k > 0) be the unilateral 
shift. Let 7r be the orthogonal projection onto the space generated by eo, 
and define T = S + it. A direct computation, supported by the relations 

S*S = id, SS* = id — 7r, ttS = 0, 

yields (T* — id) (T*T — id) = 0. But T is not subnormal. Indeed, any 

subnormal operator X satisfies the hyponormality inequality [X*,X] > 0, 

evident from the matrix decomposition of a normal extension N = ( oz ) 

and the equation [N*,N] = 0. The commutator [T*,r] = tt — Sir — ttS* 

( 1 -A 
acts on the span of eo and e\ as I n J , and this is not a nonnegative 



v -1 / 
operator. In summary, this shows: 

Proposition 5.8. Let I C M[x,y] be the above ideal, corresponding to the 
unit circle with a thickened point. Then I satisfies condition (Sf), but not (S) 
(and a fortiori, not (A)). In contrast, its reduced version \fl satisfies (A) 
(and therefore also (S) and (Sf)). □ 

6. Semi-algebraic sets 

6.1. In Section[3]we studied the question whether every polynomial strictly 
positive on a real algebraic set X C C n is a hermitian sum of squares 
on X. We now extend this question to (real) semi-algebraic subsets of 
C n . Algebraically, this means that instead of an ideal / C R[x, y] and the 
semiring E^ + I we consider hermitian modules, that is, modules over the 
semiring E/j (see l2.4p . This means that the real algebraic set X = Vr(I) is 
replaced by the closed set 

X M = {aeC:V/eM /(a) > 0} 

(see !2.6p . If the hermitian module M is finitely generated (or, more generally, 
if the quadratic module generated by M is finitely generated) , the closed set 
Xm is basic closed, i.e., there are finitely many fi,--.,fk £ ^[x,y] with 
X M = {a £ C n : h(a) > 0, . . . , f k (a) > 0}. 
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6.2. Each of the four properties of an ideal I C R[x, y] labelled (A), (Q), (S), 
(Sf) that were discussed in the first part of this paper is in fact a property 
of the semiring S = Eft, + I, i.e., can be expressed in terms of S. We now 
extend these properties to arbitrary hermitian modules M C R[x, y]: 

(A) M is archimedean; 

(Q) M contains every / G R[x, y] with / > on Xm', 

(S) every commuting tuple T of bounded operators in a Hilbert space 
satisfying p(T,T*) > for every p £ M is subnormal; 

(Sf) every commuting tuple T of complex matrices satisfying p(T,T*) > 
for every p £ M is normal. 
When M = S/j + / for some ideal I, the above properties agree with the 
respective properties of the ideal /, as defined in 13.11 and 13.81 

We start by the following characterization of archimedean hermitian mod- 
ules, thereby generalizing part of Proposition 13.21 

Lemma 6.3. A hermitian module M C R[x, y] is archimedean if and only 
if c — ||z|| 2 G M for some real number c. 

Proof. Assuming c — | |z| | 2 G M we have to show R + M = R[x, y]. For this 
let A := {p G C[z] : - \p\ 2 G R + M}. It suffices to prove A = C[z]. Indeed, 

qk G A for every k, then / G R + M. 



any / G R[x, y] can be written / = ]T\ l^l 2 - ^ fc | % | 2 with p^, q k G C[z]; if 



From c — |zj| = (c — ||z|| ) + Ylk^j \ z k\ we see z j e ^ for j = 1, 



, n. 



Therefore (and since C C yl) it is enough to prove that A is a ring. From 
a - |/| 2 , 6 - \g\ 2 G M with a, b > we get 

a6-|/ 5 | 2 = a (6-| 5 | 2 ) + | 5 | 2 (a-|/| 2 ) G M, 

so A is closed under products. From \f + g\ 2 + \f — g\ 2 = 2(|/| 2 + |g| 2 ) we 
see that A is also closed under sums. The lemma is proved. □ 

Before we start discussing a Positivstellensatz for hermitian modules, we 
need to mention a subtle point. The archimedean Positivstellensatz !2.6l holds 
for modules over archimedean semirings, but not in general for archimedean 
modules over semirings. This distinction is relevant for hermitian modules, 
as the following example shows: 

Example 6.4. The hermitian module M = £^+£^(1— ||z|| 2 ) is archimedean 
by Lemma 16.31 But there exist polynomials that are strictly positive on the 
closed unit ball Xm and are not contained in M. In fact, e + (1 — ||z|| 2 ) 2 is 
such a polynomial for < e < 1. To see this, assume 

e+(l-||z|| 2 ) 2 = p + g(l-||z|| 2 ) 

with p, q G S/j. Comparing constant coefficients gives q(0) < 1 + e, while 
comparing coefficients of z\Z\ gives —2 > —q(0), i.e. q(0) > 2, since the 
coefficient of z\Z\ in any hermitian sum of squares is nonnegative. 

The point is that, although the hermitian module M is archimedean, M 
is not a module over any archimedean semiring. 
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The proper "quantization" of the module M is a linear operator T acting 
on a Hilbert space, subject to the contractivity condition 

I-T*T > 0. 

It is clear that not every contractive operator T is subnormal. 

Example 16.41 has shown (c.f. also Proposition I3.2p : 

Lemma 6.5. Consider the following two properties of a hermitian module 
M mR[x,y]: 

(i) Xm is compact, and M has the Quillen property (Q); 
(ii) M is archimedean (A). 

Then (i) implies (ii), but the converse fails in general. □ 

Here we are mainly interested in a Positivstellensatz, that is, in the Quillen 
property (Q), in the case when Xm is compact. Therefore, we will often 
assume that M is archimedean (which implies that Xm is compact), and 
try to find additional properties for M that will imply the Positivstellensatz. 
Verifying the archimedean property of a concretely given hermitian module 
M is usually easy, using the criterion of Lemma 16.31 

One instance where we get the Positivstellensatz for free is the following: 

Proposition 6.6. Let I C R[x, y] be any ideal with the archimedean property 
(A). Then for any p\, . . . ,p r 6l[x,y], the hermitian module 

M = I + S ft + PiT, h -\ h Pr^h 

has the Quillen property (Q). 

Proof. M is a module over the archimedean semiring S^ + J, so the assertion 
follows from the archimedean Positivstellensatz 12.61 □ 



Remarks 6.7. 

1. For M as in 16.61 the associated semi-algebraic set is 

X M = Vr(I) n {a e C n : Pl (a) > 0, . . . , p r (a) > o}. 

2. In the particular case I = (1 — ||z|| 2 ), Proposition 16.61 was proved by 
D'Angelo and Putinar ([5], Thm. 3.1). 

Generalizing Proposition 13.91 the Positivstellensatz for M implies the fol- 
lowing subnormality property for commuting tuples T of bounded operators. 

Corollary 6.8. For any hermitian module M C R[x,y], we have (Q) => (S). 

Proof. The proof of Proposition 13.91 carries over (essentially) verbatim. □ 

We next discuss a refinement of the last corollary, in which we are going 
to weaken condition (Q) and strengthen condition (S). 

6.9. Let M C R[x,y] be a hermitian module. Recall [20J that M is said to 
have the Strong Moment Property if the following holds: 
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(SMP) Every linear functional L: M[x,y] —> R with L\m > is integration 
with respect to some positive Borel measure on Xm- 

Lemma 6.10. Let M C R[x, y] be a hermitian module. The Quillen property 
(Q) f or M implies the strong moment property (SMP) for M . The converse 
is true if M is archimedean. 

Proof. Assume that M has property (Q). Then any / € R[x,y] nonnegative 
on Xm lies in the closure M with respect to the finest locally convex topology 
on R[x,y]. So L\m > implies L(f) > 0, and therefore L is integration 
with respect to some positive Borel measure on Xm, according to the Riesz- 
Haviland theorem [Hj. Conversely, let M be archimedean and satisfy (SMP), 
and assume that there is / G R[x,y] with / > on Xm but / ^ M. By 
Eidelheit's separation theorem (e.g., [10] §17.1) there is a linear functional 
L: R[x,y] -)• R with L(l) = 1, L\ M > and L(f) < 0. By assumption, L 
is integration with respect to a probability measure [x on Xm- We conclude 
L{f) = f x f dfi > 0, a contradiction. □ 

Lemma 6.11. For M a hermitian module in R[x,y], consider the following 
property: 

(SOS) £ C M (closure with respect to the finest locally convex topology on 

R[x,y]j. 

Then property (SOS) for M implies property (S) for M . The converse is 
true if M is archimedean. 

Proof. (SOS) =^ (S): Let T be a commuting tuple of operators with M C 
Mt- Since Mt is closed in R[x, y] (|3.5p we have M C Mt- So assumption 
(SOS) implies X C Mt, and hence T is subnormal according to Lemma [3.71 
(S) =^ (SOS): Let M be archimedean and have property (S). Assume that 
there is / £ £ with / ^ M. There exists a linear functional L: R[x, y] — > R 
satisfying L\m > and L{f) < 0. We extend L to a complex linear func- 
tional on R[x,y] (g)R C = C[z,z] and perform a GNS construction: Consider 
the positive semi-definite inner product (p, q) := L(pq*) on C[z], and let E be 
the corresponding Hilbert space completion of C[z]. Since M is archimedean, 
there is a real number c > with c— \zj | 2 G M for j = 1, . . . , n, and it follows 
for any p £ C[z] that 

L(|z/|p(z)| 2 ) < c-L(|p(z)| 2 ). 

Hence multiplication by Zj induces a bounded linear operator Tj on E (j = 
l,...,n), and T = (T\,. . . ,T n ) is a commuting tuple in B{E). For g £ 
C[z,z] and q G C[z] we have (ipT(g)q, q) = L{g\q\ 2 ). So for g G M the 
operator tpT^g) is nonnegative, which means M C My. On the other hand, 
f i M T since ($r(/)l,l) = L(/) < 0. Therefore, the tuple T is not 
subnormal, according to Proposition 13.71 This contradicts property (S). □ 

Lemma 6.12. For any hermitian module M we have (SMP) =^- (SOS). 
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Proof. By hypothesis, any linear functional L: M[x,y] — > R with L\m > is 
integration with respect to a measure on Xm- ln particular, L(f) > for 
any / G £. This implies / G M. □ 

Remarks 6.13. 

1. The implication (S) =4> (SOS) for archimedean M (Lemma I6.1ip is 
uninteresting if M is a module over an archimedean semiring. Indeed, in 
this case we know anyway that M contains all polynomials strictly positive 
on Xm, and hence ECMis clear. But in the other cases, the equivalence 
of (S) and (SOS) is a new information. 

2. Altogether we have now obtained the chain of implications 

(Q) =>- (SMP) => (SOS) => (S) 

for any hermitian module M. When M is archimedean, the first and the 
last implication can be reversed. 

6.14. Under a stronger condition on M, we are now going to prove the 
implication (SOS) =4> (SMP), and hence the equivalence of (Q) and (S), 
when M is archimedean. Recall that a closed subset K C C™ is said to be 
polynomially convex if the following holds: For every £ G C n with £ £ K, 
there exists a polynomial p G C[z] with \p\ < 1 on K and |p(£)| > 1. We 
shall consider the following property for a hermitian module M C M[x, y]: 

(PC) For every (£C"\ Xm-, there exist / G M and q G S^ such that 
g < 1 on {a G C n : f(a) > 0}, and such that g(£) > 1. 

Remarks 6.15. 

1. If M satisfies condition (PC), then the set Xm is polynomially convex 
in C n . Indeed, Im has the form 



X M = f]{aeC n :q u (a)<l} 



for some family of polynomials q u G S^. For each f , the set {a G C n : q v (a) < 
1} is polynomially convex, since it is the preimage of the closed unit ball in 
some C m under a polynomial map C n — > C m . Therefore Xm is polynomially 
convex. 

2. Condition (PC) is satisfied when the hermitian module M is generated 
by polynomials of the form 1 — q v with q v G S/j. More generally, (PC) 
holds when M contains a family {/,,} of polynomials with J^ = Pli/{ a ^ 
C n : /i/(a) > 0} such that, for every i/, the set {/ v > 0} is polynomially 
convex. 

Theorem 6.16. Let M be an archimedean hermitian module in R[x, y] 
which satisfies condition (PC). Then the subnormality property (S) implies 
the Quillen property (Q) for M. 

Proof. By Lemma 16.101 it suffices to show that M has the strong moment 
property (SMP). Given a linear functional L: M[x,y] — > M with L\m > 0, we 
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need to show that L is integration with respect to a positive Borel measure 
supported on Xm- Similar to the proof of Lemma 16.111 we use a GNS 
construction to get a Hilbert space E together with a commuting tuple T = 
(Ti, . . . , T n ) in B{E) and a cyclic vector £, such that L(pq*) = (p(T)£, q(T)£) 
for all p, q E C[z]. Since M has property (S), the tuple T is subnormal. The 
spectral measure of a commuting normal extension S of T gives a Borel 
measure \x on C n with 

£(/) = (Mm,o = Jfdn 

for all / G R[x, y]. It remains to prove supp(/i) C Xm, which follows from 
the next lemma. □ 

Lemma 6.17. Let M be a hermitian module, and let fi be a positive measure 
on C n all of whose moments exist, satisfying J f d\x > for every f G M. 
If M satisfies condition (PC), then supp(fi) C Xm- 

Proof. Assume there exists £ G supp(//) with ^ ^ Xm- By (PC) we find 
/ G M and q G S/i such that (?(£) > 1 and g < 1 on {/ > 0}. We have 

ffq m d(l = f fq m dn+ [ fq m d^. 

For 77i — > oo, the first summand on the right tends to zero by the dominated 
convergence theorem. On the other hand, the second summand tends to 
-co: Consider a small ball B around £ on which q > a > 1 and / < b < 0, 
and note that B has positive //-measure. So there is m G N for which the 
integral on the left is negative, a contradiction since fq m G M. □ 

Summarizing Lemmas l6.10ff6TT2l and Proposition 16.161 we obtain: 

Theorem 6.18. Let M be a hermitian module in R[x, y] which is archimedean 
and satisfies condition (PC). Then for M we have 

(Q) ^ (SMP) <^ (SOS) <^ (S). □ 

Example 6.19. Without any hypothesis like (PC), the implication (S) => 
(Q) is false, even if we assume that M is archimedean. Indeed, let n = 1 
and < r < R, and consider the S^-module 

M = E + Z h (R 2 - \z\ 2 ) + Z h (\z\ 2 - r 2 ). 

Here Xm is the annulus around the origin with radii r < R. Clearly, M 
is archimedean (|6.3p and satisfies condition (S) (Lemma 16. lip . But there 
exists a compactly supported measure fi on C with supp(/x) % Xm and with 



f(z)Kdz) > 
for every / G M. Namely, let r < p < R, and let 

J fdn:= 6/(0) + J* f(pe u )dt. 
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When e > is sufficiently small, we have /(|^| 2 — r 2 )\p(z)\ 2 dp > for every 
p £ C[z], and hence j f{z) /j,(dz) > for every / £ M. Namely, the integral 
is 

(p 2 -r 2 ) T \p(pe u )\ 2 dt-er 2 \p(0)\ 2 > {2tt( P 2 - r 2 ) - er 2 )\p(0)\ 2 . 

J — TT 

Note that the annulus Xm is not polynomially convex, and so the hypotheses 
of Theorem 16, 161 are not satisfied. 

Theorem 6.20. Let M C M[x,y] be a finitely generated hermitian module. 
If M satisfies (Sf), then the semi- algebraic set Xm in C n does not contain 
an analytic disc. 

Proof. By an analytic disc we mean the image of a nonconstant holomorphic 
map <p: B — >■ C n . We can assume 93(0) = 0, and since we work locally, we 
can assume that there exists an analytic function F: 93(B) — > B such that 
F((p(C)) = ( (( £ B). By assumption, 99(B) C X M - Let M be generated 
by nonzero polynomials fi, ■ ■ ■ , f r £ lR[x, y]. We reorder the generators so 
that /1, . . . , f s vanish identically on 92(B), and f s +i, ■ ■ ■ , fr have only isolated 
zeros on 93(B). By passing to an appropriate subdisc on B we can assume 
/ s+ i(0)>0,...,/ r (0)>0. 

Let e > be sufficiently small, and choose a non-normal matrix A of 
norm \\A\\ < e. Then the commuting tuple of matrices <p(A) is not normal, 
as A = F(ip(A)) by the superposition property of the analytic functional 
calculus. In addition, M C M^on. Indeed, fixing 1 < j < s, the composite 
function fj(ip(z),ip(z)) is identically zero. If 






T<**P 



fj{z,z) = 2^c Q)/9 z a z 

a,0 

this means that the power series 

^C a ^ifi{z) ai ■ ■ ■ lf n (z) an ipi{z) * ■■■lf n {z) 
a,(3 



■Pn 



in z und ~z is identically zero, from which we see fj(ip(A)) = 0. On the other 
hand, f s+ i((p(A)) > 0, . . . , f r (<p(A)) > by the continuity of the functional 
calculus. 

Since (f(A) is not (sub-) normal, this implies that M does not satisfy (Sf). 

□ 

Remark 6.21. In the case where M = S^ + / for some ideal / C M[x,y], 
Theorem 16.201 also follows from Theorem 14.41 Indeed, assume that 93: B —t- 
C n is a holomorphic map with 93(B) C Vr(J) and with u := V 2 93(0) ^ 0. 
A direct calculation shows that / C J(a, U) (see 14. 2h with a := 93(0) and 
U := u*u (we consider u as a row vector). Therefore, the ideal / does not 
satisfy condition (G), and by Theorem 14.41 it neither satisfies (Sf). 

By Corollary 16.81 Theorem 16.201 implies: 
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Corollary 6.22. Let M be a finitely generated hermitian module. If Xm 
contains an analytic disc, then M does not satisfy Quillen's property (Q). 

□ 

Proof. Let M be generated by nonzero polynomials fi,...,f r £ M[x,y]. 
Since Xm contains an analytic disc, the subnormality condition (S) does 
not hold for M, by Theorem 16.201 above. According to Corollary 16.81 the 
Positivstellensatz does not hold either. □ 

7. Historical comments 

We feel that leaving aside the analytic roots of the questions encountered 
in this article would deprave the reader of some essential insight. We briefly 
describe below some of the old sources and applications of the decomposition 
of a real polynomial in a sum of hermitian squares. 

Start with Riesz-Fejer Theorem asserting that a polynomial p(z, z) which 
is non-negative on the unit circle can be decomposed as 

p(z,z) = \h(z)\ 2 + (l-\z\ 2 )g(z,z), (5) 

where h G C[z] and g £ C[z,~z]. 

Next we "quantize" the above setting, that is we replace the complex 
variable by a linear transformation. Let T be a bounded linear operator 
acting on a Hilbert space E and denote by T* its adjoint. The simple 
operator identity 

T*T = id 

defines an isometric transformation, with the known consequences: spectral 
picture, functional model and classification, see [3]. In particular, the oper- 
ator T has in this case spectrum contained in the closed unit disc D, and 
for every real valued polynomial p(z, ~z) the estimate 

p(T,T*) < maxp(A,A)id (6) 

agt 

holds true. Recall that here we adopt the hereditary calculus convention, 
putting the powers of T* to the left of the powers of T, in every monomial 
appearing in p. 

Inequality ([6]) is a simple consequence of ([5]): If p(z,~z) < M on T, then 

M-p(T,T*) = h{T)*h(T) + [(l-\z\ 2 )g(z,z)]{T,T*) = h(T)*h(T) > 0. (7) 

As a matter of fact, estimate (0) implies that the linear functional calculus 
p(z, ~z) i—)- p(T,T*) possesses an additional positivity property. The latter 
implies, essentially repeating F. Riesz construction of the representing mea- 
sure for a positive functional, that the operator T is subnormal, that is, 
there exists a larger Hilbert space E C K and a normal operator U acting 
on K, such that U(E) C E and U\e = T. By choosing U minimal with 
this property we can also assume that the spectrum of U is contained in the 
torus T, hence U is unitary, see for instance [4]. In particular, if in addition 
TT* = id, that is T is unitary from the beginning, we obtain in this manner 
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a proof of the spectral theorem, as advocated by F. Riesz from the dawn of 
functional analysis [171 [18] . 

Turning now to several complex variables, or their quantized form, com- 
muting tuples of linear operators, we encounter Quillen's idea [16 1 . Let 
P{z,~z) be a conjugation-invariant polynomial, bihomogeneous of the same 
degree in the variables z and z. Assume that P(z,z) > whenever z/0. 
Denote by M = (M Zl , . . . , M Zn ) the n-tuple of commuting multipliers by the 
complex variables, on the Bargmann-Fock space of entire functions (square 
integrable in C n with respect to the Gaussian weight). Using analytical tools 
(elliptic estimates and Fredholm theory), Quillen analyzes the positivity of 
the operator P(M,M*) inherited from the positivity of the symbol P. He 
reaches the purely algebraic conclusion that there exists a positive integer 
N and homogeneous complex analytic polynomials hx,...,hk such that 

\\zf N P(z,z) = \h 1 (z)\ 2 + --- + \h k (z)\ 2 . (8) 

Very recently Drout and Zworski [7] have obtained, using the same Bargmann- 
Fock space representation, degree bounds in Quillen's decomposition above. 

An elementary dehomogenization argument shows that Q implies that 
every positive polynomial on the unit sphere of C n is equal, on the sphere, 
to a sum of hermitian squares, as stated by condition (Q) in our article. 

On the abstract operator theory side, we mention the 1987 discovery 
of Athavale [2j stating that every commuting tuple of bounded operators 
T = (Ti, . . . , T n ) subject to the sphere identity 

2?Ti + • • • + T*T n = id 

is subnormal, and hence possesses a functional calculus with a positivity 
property of type ([!]). Athavale's work belongs to a framework advocated 
for several dozen years by now by Conway [4j, Agler and McCarthy [T| and 
their followers. 

Quillen's theorem was rediscovered in 1996, generalized and put into the 
context of Cauchy-Riemann geometry and function theory of several com- 
plex variables by Catlin and D'Angelo [3]. Their proof also uses analysis, 
this time employing analytic Toeplitz operators acting on the Bergman space 
of the unit ball. One of the main themes of research in Cauchy-Riemann 
geometry is the (local) classification up to bi-holomorphic transformations 
of real algebraic subvarieties of C n . There is no surprise that Quillen prop- 
erty, or better its algebro-geometric consequences (Sf) and (G) are relevant 
for CR manifold theory. A modest step into this direction was taken in [6 J . 
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